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Computing inter-autonomous System MPLS traffic 

ENGINEERING LSP PATHS 

Background Of the Invention 

5 The present invention relates to data networking and more particularly to path 

computation in certain types of situation. 

MPLS (Multi-Protocol Label Switching) Traffic Engineering has been developed 
to meet data networking requirements such as guaranteed available bandwidth. MPLS 
Traffic Engineering exploits modem label switching techniques to build guaranteed 
10 bandwidth end-to-end tunnels through an IP/MPLS network of labels switched routers 
(LSRs). These tunnels are a type of label switched path (LSP) and thus are generally 
referred to as MPLS Traffic Engineering LSPs. 

Estabhshment of an MPLS Traffic Engineering LSP fi*om an LSP head-end to an 
LSP tail-end involves computation of a path through the network of LSRs. Optimally, 

15 the computed path is the "shortest" path, as measured in some metric, that satisfies all of 
the relevant constraints such as e.g., required bandwidth, availability of backup bypass 
tunnels for each Unk and node included in the path, etc. Path computation can either be 
performed by the head-end LSR or by some other entity operating as a path computation 
element (PCE). The head-end (or PCE) exploits its knowledge of network topology and 

20 resources available on each link to perform the path computation according to the LSP 
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Traffic Engineering constraints. Various path computation methodologies are available 
including CSPF (constrained shortest path first). 

Up until now, MPLS Traffic Engineering LSPs have been configured within a 
single Autonomous System (AS) or Interior Gateway Protocol (IGP) area. The term 
"Autonomous SystemV generally refers to a group of routers within a network that are 
subject to a common authority and use the same intradomain routing protocol. It is now 
desirable to extend MPLS Traffic Engineering LSPs across AS boundaries. This would 
greatly improve traffic management and quaUty of service across multiple service 
providers over what has been achieved using prior art Border Gateway Protocol (BGP)- 
based techniques. 

One difficulty that arises in achieving this goal is that path computation at the 
LSP head-end requires knowledge of network topology and resources across the entire 
network between the head-end and the tail-end. Yet service providers typically do not 
15 share this information with each other across AS borders. Neither the head-end nor any 
PCS will have sufficient knowledge to compute a path. Prior art MPLS Traffic 
Engineering path computation methodologies thus do not operate in an inter- AS context. 

A similar problem arises in computing the paths of MPLS Traffic Engineering 
LSPs across what are referred to as "areas." An area is a collection of routers that share 
20 fiiU network topology information with each other. To improve routing scalability, a 
service provider may divide an AS into multiple areas. Network topology and resource 



5 



10 



Attorney Docket No. CISCP850 
Client Reference No. 7787 



PATENT 



information do not, however, flow across area boundaries even though a single service 
provider may operate all the IGP areas. Like in the inter- AS case, the standard MPLS 
Traffic Engineering path computation techniques cannot compute inter-area paths 
5 because overall network topology and resource information may not be available at any 
one node. 

What is needed are systems and methods for computing the paths of MPLS 
Traffic Engineering LSPs across area and/or AS boundaries. 
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SUMMARY OF THE INVENTION 



Embodiments of the present invention provide systems and methods for 
computing the paths of MPLS Traffic Engineering LSPs across Autonomous System 
and/or area boundaries. A distributed path computation algorithm exploits muUiple path 
computation elements (PCEs) to develop a virtual shortest path tree (VSPT) resulting in 
computation of an end-to-end optimal (shortest) path. In some implementations, the 
VSPT is computed recursively across all the Autonomous Systems and/or areas between 
the head-end and tail-end of the Traffic Engineering LSP. 

Further understanding of the nature and advantages of the inventions herein may 
be realized by reference to the remaining portions of the specification and the attached 
drawings. 



Attorney Docket No. CISCP850 
Client Reference No. 7787 



4 



PATENT 



Brief Description Of The Drawings 

Fig. 1 depicts an inter- Autonomous System path computation scenario according 
to one embodiment of the present invention. 

5 Fig. 2 is a flow chart describing steps of computing a path of an MPLS Traffic 

Engineering LSP across muhiple Autonomous Systems according to one embodiment of 
the present invention. 

Figs. 3 A-3D depict evolution of a virtual shortest path tree across multiple 
Autonomous Systems according to one embodiment of the present invention. 

10 Fig. 4 depicts an inter-area path computation scenario according to one 

embodiment of the present invention. 

Fig. 5 is a flow chart describing steps of computing a path of an MPLS Traffic 
Engineering LSP across multiple areas according to one embodiment of the present 
invention. 

15 Fig. 6 depicts a network device suitable for implementing embodiments of the 

present invention. 
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DESCRIPTION OF SPECIFIC EMBODIMENTS 

The present invention will be described with reference to a representative network 
environment and employs a certain combination of network protocols to forward data 
5 through.the network. The Unks may be implemented using any type of physical medium 
such as, e.g., an optical medium, wireless medium, twisted pair, etc. Links may also be 
logical connections that give the connected nodes the property of adjacency in view of 
the operative networking protocols. 

In one embodiment, the nodes of such a network interoperate in a manner 
10 specified by various protocols including e.g., TCP/IP and protocols defined but not 
limited to by the following documents: 

E. Rosen, et al., "Multiprotocol Label Switching Architecture," RFC 3031, 
Intemet Engineering Task Force, January 2001. 

Braden, et al. "Resource ReSerVation Protocol (RSVP)-Version 1 Functional 
15 Specification," RFC 2205, Intemet Engineering Task Force, September 1997. 

Awduche, et al, "Requirements for Traffic Engineering Over MPLS," RFC 
2702, Litemet Engineering Task Force, September 1999. 
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Berger, et al, "Generalized MPLS Signaling - RSVP-TE Extensions" RFC 
3473, Internet Engineering Task Force, January 2003. 

Vasseur, et al. "RSVP Path Computation Request and Reply Messages," Internet 
5 Draft, Internet Engineering Task Force, June 2002. 

Lindem, et al., "Extensions to OSPF for Advertising Optional Router 
Capabilities," Internet Draft, Internet Engineering Task Force, October 2003. 

Vasseur, et al., "OSPF Traffic Engineering Capability TLVs," Internet Draft, 
Internet Engineering Task Force, October 2002. 

10 The above documents are incorporated herein by reference in their entirety for all 

* 

purposes. 

In one embodiment, the nodes of example networks described herein are IP 
routers that implement multiprotocol label switching (MPLS) and operate as label 
switched routers (LSRs). In one simple MPLS scenario, at the ingress to the network, a 
15 label is assigned to each incoming packet based on its forwarding equivalence class 
before forwarding the packet to the next hop node. At each intermediate node, a 
forwarding selection and a new substitute label are determined by using the label found in 
the incoming packet as a reference to a label forwarding table that includes this 
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information. At the network egress (or one hop prior), a forwarding decision is made 
based on the incoming label but optionally no label is included when the packet is sent on 
to the next hop. 

5 The paths taken by packets that traverse the network in this manner are 

preconfigured and referred to as label switched paths (LSPs). Establishment of an LSP 
requires computation of a path, signaling along the path, and modification of forwarding 
tables along the path. MPLS Traffic Engineering estabUshes LSPs that have guaranteed 
bandwidth under certain conditions. 

10 Embodiments of the present invention will be described with reference to two 

principal examples that involve computation of an optimal path which in the examples is 
the shortest path. The first example involves computing the path of an MPLS Traffic 
Engineering LSP through three Autonomous Systems. An Autonomous System (AS) is 
herein defined to be a group of routers within a network that are subject to a common 

1 5 authority and use the same intradomain routing protocol. 

The second example involves computing the path of an MPLS Traffic 
Engineering LSP through three "areas." An area is a collection of routers that share fiill 
network topology information with each other but not necessarily with routers outside the 
area even those that with which they share common administrative control. The term 
20 area as used herein also encompasses the term "level" which has a similar meaning for 
networks that employ IS-IS as their IGP (interior gateway protocol). These examples are 
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merely representative. Furthermore, the techniques described with reference to inter- AS 
path computation may be used in the context of inter-area path computation and, under 
certain conditions, the techniques described with reference to inter-area path computation 
5 may be used for inter- AS path computation. 



Embodiments of the present invention provide a distributed path computation 
algorithm employing multiple path computation elements. Generally, there is one path 
computation element per AS or area used for the computation of specific LSP paths. The 
algorithm is herein referred to as the virtual shortest path tree (VSPT) algorithm. One 
10 implementation operates as a recursive backward path computation. 



The following discussion of terminology and assumptions will refer to a multiple 
AS case but it will be understood that wherever Autonomous Systems are referred to, 
areas may be used also. The VSPT algorithm is applicable to tree-based multi-AS or 
multi-area topologies and can also be extended to arbitrary multi-AS topologies. This 
15 type of topology involves an arbitrary set of Autonomous Systems with the constraint 
that there is only one sequence of Autonomous Systems that connects any pair of 
Autonomous Systems. 



A path computation element (PCE) is an entity having the capability to compute 
paths between any nodes in an AS and communicate with other path computation 
20 elements in other Autonomous Systems. 



Attorney Docket No. CISCP850 
Client Reference No. 7787 



Patent 



A path computation client (PCC) is a requesting entity, e.g., a label-switched 
router (LSR) that sends a path computation request to a path computation element. When 
a path computation element sends a request to another path computation element, it acts 
5 as a path computation client. 

Now assume a topology with n Autonomous Systems: ASl, AS2, ASS, .... ASn. 
Each AS has a dedicated path computation element PCEi which can either be statically 
configured on the path computation cUents or dynamically discovered by means of IGP 
extensions. Also, the border routers that interconnect each AS will be defined. An entry 
10 border router of ASi is a border router that connects ASi-1 to ASi. An exit border router 
of ASi is a border router that intercormects ASi to ASi+1 . hi the case of areas rather than 
autonomous systems since the routers are under a common authority, a single router may 
in fact serve as the exit border router of one area and the entry border router of another 
area. 

15 For each ASi, we can define a set, X-en(i), of entry border routers identified as 

ASBR-en(k,i), the kth entry border router of ASi. Similarly, there is a set X-ex(i) of exit 
border routers identified as ASBR-ex(k,i), the kth exit border router of ASL 

A virtual shortest path tree (VSPT) is established recursively by communication 
among the path computation elements. In one embodiment, at successive recursive 
20 stages a VSPTi is developed by PCEi and then transferred to PCEi- 1 . The VSPTi is 
rooted at the MPLS Traffic Engineering LSP destination or tail-end and extends to the 
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entry border routers, ASBR-en(l,i), ASBR-en(2,i) ASBR-en(X-en(i),i). The tree 

includes a path segment from each of these entry border routers to the tree root that is a 
shortest computed path that satisfies a set of constraints such as bandwidth, affinities, etc. 
5 Note that only entry border routers having connectivity to an exit border router of ASi-1 
need be considered. 

Fig. 2 is a flow chart describing steps of computing a path of an inter- AS MPLS 
Traffic Engineering LSP according to one embodiment of the present invention. At a 
step 202, a path computation request is sent from the LSP. head-end (which is acting as a 

10 path computation client) to the local path computation element. The address of the local 
path computation element may be manually configured. Altematively, the path 
computation element may advertise itself through flooding within the area/ AS. Path 
computation element discovery techniques can be in accordance with Lindem, et al., 
"Extensions to OSPF for Advertising Optional Router Capabilities," Internet Draft, 

15 Intemet Engineering Task Force, October 2003 with Vasseur, et al., "OSPF Traffic 
Engineering Capability TLVs," Intemet Draft, Intemet Engineering Task Force, 
October 2002. The request can be made in accordance with a protocol specified in 
Vasseur, et al. "RSVP Path Computation Request and Reply Messages," Intemet Draft, 
June 2002. At step 204, the path computation request is passed to the path computation 

20 ^ element in every AS on the way to the LSP tail-end. Knowledge of the other path 
computation element addresses may be by way of static configuration or BGP 
advertisements as could be readily devised by one of skill in the art. N is the number of 
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Autonomous Systems on the way to the LSP tail-end, one can say that the path 
computation request is uUimately received by a PCE-N in the AS of the tail-end. 

A backward recursive path computation begins, A step 206 sets N to the number 
5 of Autonomous Systems and an index variable n is set equal to N. A step 208 is the first 
step inside the recursion loop. At step 208, PCEn computes VSPTn. VSPTn is a shortest 
path tree rooted at the LSP tail-end and includes a path from this tail-end to every ASBR- 
en(k,n). This may be computed using the CSPF (constrained shortest path first) 
algorithm as known in the art or any other suitable algorithm. Links between entry 
10 border routers of ASn should be taken into account when computing VSPTn. A step 210 
sends information specifying VSPTn from PCEn to PCEn-1. The VPST may or may not 
be specified in such a way that hops internal to an AS and their costs are specified. A 
step 212 decrements n. 

At step 214, PCEn concatenates the VSPT it received from PCEn+1 with the 
15 topology of ASn. In one implementation, PCEn, before concatenating the ASn topology 
with VSPTn+1 may invoke a local CSPF algorithm on the interconnections between all 
ASBR-ex(k,i) and ASBR-en(k',i+l). A step 216 tests whether n equals 1, i.e., whether 
the algorithm is about to visit the path computation element in the AS of the LSP head- 
end. If n does not equal 1, another iteration begins at step 208 with the computation of 
20 VSPTn. 
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If step 216 determines that n=l, then at step 218, PCEl concatenates the received 
VSPT2 to the topology of ASl, computes (using, e.g.,.CSPF) the shortest path based on 
the concatenated topology, and then sends information specifying the shortest path to the 
5 requesting head-end. If multiple equal-cost paths are found, PCEl may provide some or 
all of them to the requesting head-end. Other situations where PCEl may return more 
than one path include, e.g., where the head-end requests the computation of N diverse 
paths. These diverse paths may or may not have equal costs. 

A specific example of the application of the just-described procedure will be 
explained with reference to Fig. 1 and Figs. 3 A-3D. Fig. 1 depicts an inter- Autonomous 
System path computation scenario. There are three Autonomous Systems, ASl, AS2, 
and ASS. A path is to be computed between a head-end 102 in ASl and a tail-end 104 in 
AS3. Fig. 1 shows 8 border routers positioned to connect the Autonomous Systems: 
ASl, AS2, .... AS3. Additionally, Fig. 1 shows ASl as including routers 106, 108, and 
110, AS2 as including routers 1 12 and 1 14, and AS3 as including router 104 and 1 16. 

All of the border routers are BGP (Border Gateway Protocol) peers. The 
protocols used for routing within an AS (e.g., an IGP), do not operate on the links 
connecting the border routers. Within an AS, an IGP protocol such as the well-known 
IS-IS or OSPF operates. ASBRl, ASBR8, and ASBR9 operate as path computation 
20 elements for their respective Autonomous Systems. AUlinkshaveacost of one. At least 
one path satisfying the set of required Traffic Engineering constraints (bandwidth, 
affinities, etc.) can be found between 1) the head-end 102 and every exit border router of 
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ASl, 2) every entry border router of AS2 and every exit border router of AS2, and 3) 
every entry border router of ASS and the destination. 

The VSPT and shortest path is found as follows. A path computation request is 
5 sent from head-end 102 to ASBRl. ASBRl is known as a path computation element to 
head-end 102 either by, e.g., static configuration or by use of an extension to the locally 
operative IGP protocol. The path computation request is relayed from ASBRl to ASBR8 
and then on to ASBR9. ASBR9 computes a VSPT3 interconnecting ASBR9 and 
ASBRl 0 with tail-end 104. This VSPT3 is depicted in Fig. 3 A. Intermediate hops such 
10 as ones involving router 1 16 are omitted from the representation of VSPT3 to reflect their 
omission from the information specifying VSPT3 that is transmitted to ASBR2. VSPT3 
includes a path 302 that includes one hop from ASBR9 to tail-end 104 and has a cost of 
1. VSPT3 fiirther includes a path 304 that includes hops from ASBRIO to router 1 16 (not 
shown) on to tail-end 104. Path 304 has a cost of 2. 

15 Upon receipt of VSPT3, ASBR8 concatenates the topology of AS2 with VSPT3 

resulting in the mesh depicted in Fig. 3B. ASBR8 uses this topology to determine 
VSPT2 as shown in Fig. 3C. Only the costs and the endpoints are shown in Fig. 3C. A 
path 306 extends from ASBR4 through router 1 12, ASBR7, and ASBR9 to tail-end 104. 
A path 308 extends from ASBR5 through ASBR4, router 1 12, ASBR7, and ASBR9 to 

20 tail-end 104. A path 310 extends from ASBR6 through router 1 14, ASBR7, and ASBR9 
to tail-end 104. Information specifying VSPT2 is sent to ASBRl. 
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ASBRl concatenates the topology of ASl with VSPT2 to obtain a mesh shown in 
Fig. 3D. ASBRl can then find a shortest path. Here there are multiple shortest paths 
having cost 7. One shortest path goes fi-om head-end 102 through router 108, ASBRl, 
5 ASBR4, router 1 1 2, ASBR7, and ASBR9. Another shortest path traverses router 1 1 0, 
ASBR3, ASBR6, router 1 14, ASBR7, and ASBR9 before reaching tail-end 104. One or 
more of the shortest paths are reported to head-end 102. 

Development of VSPTs and selection of the shortest path(s) involves comparing 
the costs of paths. Since links making up the paths may be in Autonomous Systems 

10 operated by different service providers, the metrics used for measuring cost may not be 
the same. There are various ways of addressing this issue. One way is to simply 
standardize the metric (e.g, the standard Traffic Engineering metric) among service 
providers that carry each other's MPLS Traffic Engineering LSPs. In an aUemative 
implementation, a mapping mechanism is defined to translate costs in one metric to costs 

15 in another metric. For example, if cost is a fimction of only link bandwidth, tables 
correlating standard link bandwidth values (e.g., DS-1, DS-3, 0C3, OC48, OC192) to 
local AS cost values could be passed among path computation elements. A PCEi that is 
transmitting VSPT information to PCEi-1 can use this type of table to translate cost 
information of the VSPT into the metric of PCEi-1 before transmitting. Alternatively, the 

20 translation could be performed by PCEi-1 prior to computing VSPTi-1 . 

The above-described techniques can also be applied to inter-area path 
computation rather than inter- AS path computation. It will ofl;en be the case that a single 
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border router is directly connected in two areas and has access to IGP link information 
for both areas. In many common situations, the above-described VSPT development 
technique can be modified to reduce signaling. 

Fig. 4 depicts an inter-area path computation scenario according to one 
embodiment of the present invention. There are three areas, Area 1, Area 0, and Area 2. 
The path of an MPLS Traffic Engineering LSP from a head-end 402 in Area 1 to a tail- 
end 404 in Area 2 is to be computed. Area border routers ABRl and ABR2 interconnect 
Area 1 and Area 0. Area border routers ABRl ' and ABR2' interconnect Area 0 and Area 
2. Area 1 further includes routers 404 and 406. Area 0 further includes routers 408, 410, 
and 412. Area 2 further includes a router 414. 

Fig. 5 is a flow chart describing steps of computing a path of an MPLS Traffic 
Engineering LSP across multiple areas according to one embodiment of the present 
invention. At step 502, head-end 402 sends a path computation request to ABRl to 
15 compute a path to tail-end 404. ABRl computes shortest paths from head-end 402 to 
ABRl and fi:-om head-end 402 to ABR2 at step 504. These shortest paths make up a 
VSPTl. This VSPTl is sent to ABR2' along with a path computation request at step 506. 
At step 508, ABR2' concatenates VSPTl with the topologies of Area 0 and Area 2. Then 
at step 510, ABR2' computes a shortest path based on the concatenated topology. This 
20 shortest path is retumed to head-end node 402 at step 512. 
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Fig. 6 depicts a network device 600 that may be used to implement, e.g., any of 
the routers of Fig. 1, Figs. 3A-3D, or Fig. 4 and/or perform any of the steps of Fig. 2 or 
Fig. 5. In one embodiment, network device 600 is a programmable machine that may 
5 be implemented in hardware, software or any combination thereof. A processor 602 
executes code stored in a program memory 604. Program memory 604 is one example 
of a computer-readable medium. Program memory 604 can be a volatile memory. 
Another form of computer-readable medivun storing the same codes would be some type 
of non-volatile storage such as floppy disks, CD-ROMs, DVD-ROMs, hard disks, flash 
10 memory, etc. A carrier wave that carries the code across a network is another example 
of a computer-readable medium. 

Network device 600 interfaces with physical media via a plurality of linecards 
606. Linecards 606 may incorporate Ethemet interfaces, DSL interfaces, Gigabit 
Ethernet interfaces, 10-Gigabit Ethemet interfaces, SONET interfaces, etc. As packets 
15 are received, processed, and forwarded by network device 600, they may be stored in a 
packet memory 608. Network device 600 implements all of the network protocols and 
extensions thereof described above as well as the data networking features provided by 
the present invention. 

In one implementation, control plane operations such as the path computation 
20 operations described above are controlled and signaled by processor 602 while 

forwarding tables are maintained on linecards 606. The present invention is, however, 
not limited to a distributed architecture. To implement functionality according to the 
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present invention, linecards 606 may incorporate processing and memory resources 
similar to those discussed above in connection with the network device as a whole. 

It is understood that the examples and embodiments that are described herein are 
5 for illustrative purposes only and that various modifications and changes in light thereof 
will be suggested to persons skilled in the art and are to be included within the spirit and 
purview of this application and scope of the appended claims and their full scope of 
equivalents. 
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